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In this paper we study a simple cascading process in a structured heterogeneous population, 
namely, a network composed of two loosely coupled communities. We demonstrate that under certain 
conditions the cascading dynamics in such a network has a two-tiered structure that characterizes 
activity spreading at different rates in the communities. We study the dynamics of the model using 
both simulations and an analytical approach based on annealed approximation, and obtain good 
agreement between the two. Our results suggest that network modularity might have implications 
in various applications, such as epidemiology and viral marketing. 

PACS numbers: 



I. INTRODUCTION 

Networks are useful paradigm for studying complex 
systems composed of large numbers of interconnected 
components [TJ E] ■ There has been a growing interest 
in applying network analysis to examine various biolog- 
ical [1 E], ecological [SHE], technological [9l \TU\ fTT|. 
and even political [12] systems. Research on various sta- 
tistical properties of such networks has revealed many 
interesting phenomena. For instance, the scale-free de- 
gree distributions observed in many real-world networks 
have significant implications for various dynamical pro- 
cesses on such networks. In particular, the dynamics of 
SIR (susceptible-infected-removed) epidemic processes 
in certain scale-free networks are characterized by van- 
ishing threshold for epidemics [13] , in sharp contrast with 
results for the random Erdos-Renyi networks. 

Another interesting property of networks is modular- 
ity, the tendency of nodes to partition themselves into 
communities |14l 115] . Loosely speaking, a community is 
a group of nodes for which the density of links within a 
group is higher than across the groups. Much recent re- 
search has focused on methods for detecting and analyz- 
ing community structure in networks (for a recent review 
of existing approaches see |16j and references therein). 
However, the dynamical properties of modular networks 
have received relatively little attention despite the po- 
tential importance of the subject to problems such as 
epidemiology, viral marketing, and so on. Consider, for 
instance, word-of-mouth (or viral) marketing of a new 
product. If different consumer groups have different rat- 
ing criteria for the product, or different reaction to mar- 
keting strategies, then one needs to model how influ- 
ence propagates within and across communities to pre- 
dict whether the product will be a hit, or confined to 
a small subset of consumers. A similar argument holds 
for epidemic models where nodes are heterogeneous with 
respect to their susceptibilities and/or interactions pat- 
terns pTj [HJ [19] . For instance, Gupta et. al. showed 
that the transient dynamics of sexually transmitted in- 



fection can be very different, depending on whether the 
network of sexual contacts is assortative or disassorta- 
tive [17] . More recent work has addressed the role of the 
modularity on dynamics of cascading failures in scale free 
networks [20) , and synchronization patterns of networked 
oscillators [21] . In particular, Ref. [21] demonstrated that 
the modularity of networks is strongly reflected in their 
synchronization dynamics, so that communities emerge 
as connected groups of synchronized oscillators. 

The goal of this paper is to further improve our under- 
standing of the connection between network modularity 
and its dynamics. Specifically, we examine the effects 
of modularity on a simple, threshold-based activation 
process on networks. Starting with a modified version 
of Watts' cascading model [22], we study its dynamical 
properties for networks composed of two loosely coupled 
communities. Our main observation is that if the initially 
active nodes (seeds) are contained in one of the communi- 
ties, then under certain conditions the cascading process 
has a two-tiered structure, that is, the peaks of the acti- 
vation dynamics in each community are well separated in 
time. We present results of simulations as well as analyti- 
cal results based on annealed approximation, and observe 
a good agreement between the two. 



II. MODEL 

Let us consider a network where each node is in one of 
two states: passive and active. Initially, all but a small 
fraction of seed nodes are passive. During the cascade 
process, a passive node will be activated with probability 
that depends on the state of its neighborhs. In Watt's 
original model [22] this probability is p = Q(hi/ki — <f), 
where is the step function, hi and fc, are the number of 
active neighbors and the total number of the neighboring 
nodes, respectively, and <fii is the activation threshold for 
the z-th node. Here we consider a slight modification of 
the original model by using a threshold condition on the 
number of active neighbors rather than their fraction: 
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p = r 0(/i, — Hi), where r determines the time-scale 
of the activation process. For the sake of simplicity, we 
assume that all nodes have the same activation threshold, 
Hi = H for all i. 

Clearly, the dynamics of the cascade process will de- 
pend on both network structure and the threshold pa- 
rameter H . Here we are interested in the case when the 
network is composed of two loosely coupled communi- 
ties. Namely, we consider a random graph consisting of 
TV = N a + N b nodes of two different type, a and b. The 
probabilities of edges between nodes of different types are 
laa, 766 and 7ab = 76a , and the average connectivity be- 
tween nodes of the respective types are then z aa = jaaN a , 
m = 766 ^6, Zab = labN b and z ba = j a bN a - We want to 
find out how the modularity of the network, as described 
by the coupling between the groups, affects the cascading 
process. 

Let pi and p® be the fraction of seed nodes in each pop- 
ulation. Further, let P a (h;t) and P b (h;t) be the proba- 
bility distribution that a randomly chosen node of corre- 
sponding type is connected with exactly h active nodes at 
time t. It is easy to see that P a (h; t = 0) and Pb(h; t = 0) 
are Poisson distributions with means z aa p Q a + z a bp\ and 
z bb p b + z baPai respectively. To study the dynamics of 
the process, we need to estimate these distributions for 
later times. This is particularly straightforward to do 
within the annealed approximation, e.g., by "rewiring" 
the network after each iteration [25] . Indeed, since all 
edges of corresponding type are equally likely, it is easy 
to see that P a (h;t) and P b (h;t) are still given by Pois- 
son distribution, with the means that now depend on 
the fraction of active nodes p a (t) and Pb{t): P a , b (h;t) = 
Poisson(X a ,b(t)), where A Q = z aa p a (t) + z ab p b (t) and 
h = z bb p b (t) + z ba p a (t). 

On the first step of the cascading process, the 
fraction of activated nodes of each type is given by 
t -1 Y^h>H Pa,b(h;t = 0). In later iterations, we can cal- 
culate the fraction of active nodes as follows. Let us con- 
sider, for instance, a nodes. There are N a (l — p a (t)) pas- 
sive nodes at time t, and each one of these nodes will be 
activated with the rate t -1 J2h>H Pa{h; t). Also, due to 
the rewiring, some of the N a (p a (t) — p° a ) active nodes will 
switch to passive state with the rate t _1 Ylh<H Paih; t). 
We note that the seed nodes never de-activate. Combin- 
ing these together, and using the normalization condition 
^2 h=Q P a ,b{h; t) = 1, we obtain the in the continuos time 
limit 

T d ^ = l- Pa ,b-(l-p° a ,b)Q(H;Xa, b ) (i) 

where Q(n,x) = X)fc<n e~ x x k /kl is the regularized 
gamma function. 

Equations [T] determine the time evolution of the cas- 
cading process in each group. Let p(t) = ap a {t) + (1 — 
a)p b (t), a — N a /(N a + Nb), be the fraction of active 
nodes in the whole network. In Figure Jl] we compare 
the solutions obtained from Equations with the re- 
sults of simulations on randomly generated graphs for the 
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FIG. 1: Analytical (solid lines) and simulation (circles) re- 
sults for the activation dynamics. The upper panel shows 
the fraction of active nodes vs time for threshold parameter 
H = 2 and H = 4. The lower panel shows the activation rate 
dp/dt vs time for H = 4. 



same network parameters but two different values of the 
threshold parameter. The parameters of the network are 
N a = 5000, N b = 15000, z aa = z bb = 15, z ab = 4. The 
fraction of seed nodes is p° = 0.1, and r _1 = 0.1. The 
simulations are averaged over 100 random realizations. 

The agreement between the analytical prediction and 
results of the simulations is quite good. The network 
settles to the same steady state for both values of the 
threshold parameter H: that is, all of the nodes are acti- 
vated at the end of the cascading process. However, the 
transient dynamics depend on the threshold parameter 
H. For H — 2, activation spreads very quickly through 
both communities and after a short interval all of the 
nodes are activate. For H = 4, on the other hand, the 
fraction of active nodes seems to saturate, then, in later 
iterations, p(t) increases rapidly and eventually all the 
nodes become active. In Figure [ljb) we plot the rate of 
activation process dp/dt vs time for H = 4. Apparently, 
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the peak rates of activation in the two communities are 
separated in time. We call this phenomenon two-tiered 
dynamics. We would like to note that previously such 
a multi-peak structure has been observed in Ref. [T7] . 
where the authors studied the impact of different mixing 
patterns on the spread of sexually transmitted infection. 

To better understand how two-tiered dynamics arises, 
we will examine a simplified scenario. Let us assume 
that seed nodes are chosen among a-nodes only, so that 
pi = 0. Further, let us assume that the coupling between 
two populations is not very strong, so that the cascad- 
ing process among a-nodes is not affected by cross-group 
links. Hence, the fraction of active a nodes evolves ac- 
cording to the following equation 



dp a 

T— — — 

dt 

where we have defined 



—Pa + 9a{z a aPa) 



g a (x) = l-(l-p° a MH,x) 



(2) 



(3) 



The fraction of the population that will be activated at 
the end of the cascading process is determined from the 
following equation: 



Pi = 9a{ZaaP 8 a ) 



(4) 



Note that for sufficiently dense networks (i.e., the con- 
nectivity of all nodes is greater than the threshold H) 
p s a = 1 is always a solution. However, it is not always 
the only solution. This is shown graphically in Figure [2] 
where we plot both sides of Equation [4] as a function of 
p s a for two different connectivities. For a given fraction 
of seed nodes the steady-state fraction of active nodes 
is determined by the connectivity z aa . In particular, for 
sufficiently large values of z aa , the only intersection of the 
curve with the line happens at p ~ 1 , aside from exponen- 
tially small correction of order ~ z H ~ 1 e~ Zaa , indicating 
that the activation will spread globally. If one decreases 
z aa , however, other solutions appear as shown by the 
two intersections of p a and g a {z aa p a ) in Figure |] Specif- 
ically, there is a critical value z% a so that for z aa < z c aa 
the cascading dynamics dies out, while for z aa > z c aa it 



spreads throughout the system. Let us define x 



and rewrite Equation [4] as z~^x = g(x). At the criti- 
cal point, the line z~^x must be tangential to g{x). ft 
is then straightforward to demonstrate that the critical 
connectivity is given by 



ZL = WaiXo)} 1 = 



(1-P° a )e- 



(H-l)l 



where xq satisfies the following equation: 

WaC^o) = ga(xo) 



(5) 



(6) 



Equations [5] and [6] determine that critical connectiv- 
ity needed to cause a global cascade among a nodes for a 
given fraction of seed nodes and the threshold parameter 
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FIG. 2: Graphical representation of the Equation [4] Plotted 
are the straight line y = p s a , and the function y — g a (z a apa) 
for two different values of z aa 



H. In Figure[3]we compare the analytical prediction with 
simulation results for H = 2. The simulations were done 
for a graph with 5 x 10 4 nodes, and for 100 random trials. 
Each parameter pair (p®, z aa ) was considered to be above 
the critical line if a global cascade was observed in the 
majority of trials for that parameters. Again, the agree- 
ment of analytical prediction and the simulation results 
are excellent. 

Let us examine the behavior of the critical connectivity 
in the limit of small p® a . The equation [6] can be rewritten 
as 
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(H-l)! 



Po 



i-p8 

Assuming p®,Xo <C 1 we obtain in the leading order 



(7) 



X 



H - 1 



(8) 



Finally, using Equation [5] we obtain the following scaling 
behavior 



Z aa « {PIT 



(9) 



which is demonstrated in the inset of Figure [3] We also 
note that at the critical point the convergence time di- 
verges as T conv oc(z- zla)- 1 / 2 . 

Now consider the cascading dynamics in the second 
group. Initially, there are no active nodes in this group. 
As more and more a nodes are activated, the activation 
will spread to the b nodes for sufficiently large across- 
group connectivity Zb a . The activation dynamics is again 
governed by an equation similar to the Equation. [2] In 
particular, the steady state fraction of active b nodes sat- 
isfies the following equation: 



Pb 



1 - Q(H, z bbP l + A) = g b (z bb p b + A). 



(10) 
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FIG. 3: (Color online) The critical connectivity vs fraction of 
seed nodes for threshold parameter H — 2. The inset shows 
the scaling behavior of z c aa for different H (the numbers in 
parenthesis are the slopes of corresponding lines). 



where A = z ba p®. Clearly, if A is sufficiently large, then 
the cascade will propagate among b nodes independent 
of the within-group connectivity z bb . And vise versa, 
however large the connectivity z bbl there is a critical value 
of \ c a so that for A < A c there will be no cascade among 
the b nodes. Let us define x = z bb p® + A and rewrite the 
steady state equation as follows: 



g - A 

Zbb 



= 9b(x) 



(11) 



Using the same reasoning as for the a nodes, it is easy to 
show that the critical point is given by 



A c = x - z bb g b (x ) 



(12) 



where Xq is the smaller of the roots of the following equa- 
tion: 



9b( x o) = 



1 

Zbb 



(13) 



Note that for p® a = 1 A c is simply the critical across-group 
connectivity z^ b (z bb ) for which the cascade will spread to 
b nodes, assuming that all a nodes have already been acti- 



vated. Hence, equations 12 and 13 implicitly define a crit- 
ical line z^ b (z ba ) on the z bb — z ba plane. Note that on this 
critical line the convergence time of the cascading process 



among the 6-nodes, and consequently the separation of 
two activity peaks, is infinite. For a fixed within-group 
connectivity z bb the two-tiered structure will be present 
provided that z ba is only slightly above the critical line. 
To be more precise, let /5™ ax be the fraction of active a 
nodes that corresponds to the maximum activation rate 
among a nodes. This can be found from Equation [2] by 
differentiating the right hand side with respect to p a and 
setting it to zero, which yields z aa g' a (z aa p" a lax ) = 1. If 
the across — group connectivity is smaller than X c /p a nax , 
then the cascade will not spread to 6-nodes until the rate 
of activation spreading among a nodes starts to decline 
from its peak. Consequently, the two tiered pattern will 
be present for the range A c < z ba < \ c / p™ ax . 



III. SUMMARY 

To summarize, we have considered a simple cascading 
model on a random network consisting of two-loosely 
coupled communities. For a sufficiently weak coupling 
between two communities the dynamics of the activity 
spreading demonstrates two-tiered structure, that is, the 
peak rates of the cascading processes in two communi- 
ties are separated in time. This pattern is reminiscent 
of multi-peak structure of sexually transmitted infection 
dynamics, previously reported in Ref. [T7]. We stud- 
ied this phenomenon both experimentally and theoreti- 
cally using annealed approximation, and obtained a good 
agreement between analytical results and simulations. 
Although our model is for undirected binary graphs, 
generalizations to directed and/or weighted graphs is 
straightforward. Directed models can be relevant if the 
interactions between two nodes are not symmetric. 

The results presented here might have implications in 
problems such as epidemiology, viral marketing, and so 
on. Consider, for example, the problem of minimizing the 
number of seed nodes that will cause a global cascade in 
a given network, or more generally, the problem of max- 
imizing a certain utility function /(Nq, N s ), where A^o is 
the number of seed nodes, and N s is the expected size of 
the cascade. Our results suggest that simple strategies 
that are suitable for homogenous networks (e.g., choos- 
ing nodes with high connectivity, or at random), might 
lead to a sub-optimal solution for networks with strongly 
modular structure. We note, however, that in order to 
assess the implication of our findings in real world prob- 
lems, one needs to generalize the approach developed here 
to more complex networks. 
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